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AN ENZYME 

The present invention relates to an enzyme. The present invention also relates to a 
nucleotide sequence coding for the enzyme. 

Boos and coworkers in 1981 and 1982 (1.2) presented evidence for the existence of 
an enzyme capable of acetylating maltose via transfer of the acetyl group from 
Acetyl-coenzyme A to maltose in £. coli. In particular, Boos et al (I) observed the 
formation of acetyl-maitose and acetyl-oligomaltosides after accumulation of maltose 
or maltooligosides in coli. They also observed the formation of acetyl-maitose and 
acetyl-oligomaltosides in vitro when maltose or maltotriose, acetyl-coenzyme A and 
a cytosolic E. coli extract were mixed together Boos et al (2). 

Boos et al in 1981 stated that the activity responsible for maltose and makodextrin 
acetylation was unknown. However, in their further studies of 1982 (2), Feundlieb 
and Boos named the unknown enzyme "maltose transacetylase*' but then said that the 
function of maltose transacetylase in E. coli was unclear. 

Later Brand and Boos (3) isolated an E, coli mutant lacking the gene encoding 
maltose transacetylase. This mutant enabled them to map the gene at 10.4 min on the 
E. coli linkage map. In addition, they cloned a 3.4 kb DNA fragment containing the 
gene in a high copy plasmid. Over-expressed maltose transacetylase was then 
purified to homogeneity from cell free extracts of an E. coli strain harbouring the 
above mentioned plasmid. The enzyme was shown to be a homodimer with two 
identical subunits of 20 kDa. The km (mM) and Vmax (/xmol/min x mg enzyme) 
values of this enzyme for the substrates glucose, maltose and acetyl-coenzyme A were 
62 and 200, 90 and 110, and 0.018 and 166 respectively. Maltotriose and other 
oligosaccharides were found to be acetylated with a rate of 2% of the rate determined 
for glucose. In addition. Brand and Boos presented the following relative 
transacetylation rates: glucose 1, maltose 0.55, mannose 0.2, fructose 0.07, galactose 
0.04, maltotriose and other malto-oligosaccharides 0.02. Oligosaccharides are 
saccharides having less than ten sugar units. 



f ^ 
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Despite of these findings Brand and Boos did not sequence the enzyme or the 
nucleotide sequence coding for the maltose transacetylase enzyme. 

According to a first aspect of the present invention there is provided an enzyme 
5 having a{lA) glucan acetyl-transferase activity, wherein the enzyme comprises the 
amino acid sequence shown as SEQ ID No. 1, or a variant, homologue or fragment 
thereof. 

According to a second aspect of the present invention there is provided a recombinant 
10 enzyme having a(l,4) glucan acetyl-transferase activity, wherein the enzyme 
comprises the amino acid sequence shown as SEQ ID No. 1, or a variant, homologue 
or fragment thereof. 

According to a third aspect of the present invention there is provided a recombinant 
15 enzyme having a(l,4) glucan acetyl-transferase activity, wherein the enzyme has the 
amino acid sequence shown as SEQ ID No. 1. 

According to a fourth aspect of the present invention there is provided a recombinant 
enzyme having a(l,4) glucan acetyl-transferase activity, wherein the recombinant 
20 enzyme is immunologically reactive with an antibody raised against a purified 
recombinant enzyme according to the above-mentioned aspect of the present 
invention. 

According to a fifth aspect of the present invention there is provided a nucleotide 
25 sequence coding for the enzyme of the present invention or a sequence that is 
complementary thereto. 

According to a sixth aspect of the present invention there is provided a nucleotide 
sequence comprising the sequence shown as SEQ ID No. 2, or a variant, homologue 
30 or fragment thereof or a sequence that is complementary thereto. 
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According to a seventh aspect of the present invention there is provided a nucleotide 
sequence having the sequence shown as SEQ ID No. 2. 

According to an eighth aspect of the present invention there is provided a construct 
comprising or expressing the nucleotide sequence or the enzyme of the present 
invention. 



According to a ninth aspect of the present invention there is provided a vector 
comprising or expressing the construct or the nucleotide sequence or the enzyme 
according to the present invention. 

According to a tenth aspect of the present invention there is provided a plasmid 
comprising or expressing the vector, the construct or the nucleotide sequence or the 
enzyme according to the present invention. 

According to an eleventh aspect of the present invention there is provided a transgenic 
organism comprising or expressing the plasmid, the vector, the construct or the 
nucleotide sequence or enzyme according to the present invention. 

According to a twelfth aspect of the present invention there is provided a modified 
carbohydrate (preferably starch) prepared by a method comprising or expressing or 
using the present invention. 

The enzyme of the present invention may be obtainable from any one of a bacterium, 
a fungus, an alga, a yeast, or a plant. Preferably, the enzyme is obtainable from 
E.coli. 

The a(l,4) giucan acetyl-transferase of the present invention is sometimes referred 
to as Mac. The gene coding for the a(l,4) giucan acetyl-u-ansferase of the present 
invention is also sometimes referred to as the rmc gene. 
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According to a seventh aspect of the present invention there is provided a nucleotide 
sequence having the sequence shown as SEQ ID No. 2. 

According to an eighth aspect of the present invention there is provided a construct 
5 comprising or expressing the nucleotide sequence or the enzyme of the present 
invention. 

According to a ninth aspect of the present invention there is provided a vector 
comprising or expressing the construct or the nucleotide sequence or the enzyme 
10 according to the present invention. 

According to a tenth aspect of the present invention there is provided a plasmid 
comprising or expressing the vector, the construct or the nucleotide sequence or the 
enzyme according to the present invention. 

15 

According to an eleventh aspect of the present invention there is provided a transgenic 
organism comprising or expressing the plasmid, the vector, the construct or the 
nucleotide sequence or enzyme according to the present invention. 

20 According to a twelfth aspect of the present invention there is provided a modified 
carbohydrate (preferably starch) prepared by a method comprising or expressing or 
using the present invention. 

The enzyme of the present invention may be obtainable from any one of a bacterium, 
25 a fungus, an alga, a yeasi, or a plant. Preferably, the enzyme is obtainable from 
E. coli. 

The a(l,4) glucan acetyl-transferase of the present invention is sometimes referred 
to as Mac. The gene coding for the a(l,4) glucan acetyl-transferase of the present 
30 invention is also sometimes referred to as the mac gene. 
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Preferably, the enzyme comprises the amino acid sequence shown as SEQ ID No I, 
or a variant, homologue or fragment thereof. 

Preferably, the enzyme has the amino acid sequence shown as SEQ ID No I. 

Preferably, the enzyme is encoded by a nucleotide sequence comprising the nucleotide 
sequence shown as SEQ ID No 2, or a variant, homologue or fragment thereof or a 
sequence that is complementary thereto. 

Preferably, the enzyme is encoded by the nucleotide sequence shown as SEQ ID No 
2. 

Preferably, the organism is a plant. 

Preferably, the nucleotide sequence is a DNA sequence. 

The enzyme or nucleotide sequence(s) coding for same may be used in vitro or in 
vivo in combination with one or more other enzymes or nucleotide sequence(s) coding 
for same, which enzymes or nucleotide sequence(s) coding for same are preferably 
prepared by use of recombinant DNA techniques. 

Thus, according to one aspect of the present invention, an in vivo enzymatic 
modification process can be followed by an in vitro enzymatic modification process. 
In these modification steps, the enzymes used need not necessarily be the same 
enzymes. 

The terms "variant", "homologue" or "fragment" in relation to the enzyme include 
any substitution of, variation of, modification of, replacement of, deletion of or 
addition of one (or more) amino acid from or to the sequence providing the resultant 
amino acid sequence has a(I,4) glucan acetyl-transferase activity, preferably having 
at least the same activity of the enzyme shown as SEQ ID No. 1 . In panicular, the 
term "homologue" covers homology with respect to structure and/or function 
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providing the resultant enzyme has a(l,4) glucan acetyl-transferase activity. With 
respect to sequence homology, preferably there is at least 75%. more preferably at 
least 85%. more preferably at least 90% homology to the sequence shown as SEQ ID 
No. 1. More preferably there is at least 95%. more preferably at least 98%, 
5 homology to the sequence shown as SEQ ID No. 1 

The terms '"variant", "homologue" or "fragment" in relation to the nucleotide 
sequence coding for the enzyme include any substitution of, variation of, modification 
of. replacement of, deletion of or addition of one (or more) nucleic acid from or to 

10 the sequence providing the resultant nucleotide sequence codes for an enzyme having 
a(l,4) glucan acetyl-transferase activity, preferably having at least the same activity 
of the enzyme shown as SEQ ID No. 1. In panicular, the term "homologue" covers 
homology with respect to structure and/or function providing the resultant nucleotide 
sequence codes for an enzyme having a(I,4) glucan acetyl-transferase activity. With 

15 respect to sequence homology, preferably there is at least 75%, more preferably at 
least 85%, more preferably at least 90% homology to the sequence shown as SEQ ID 
No. 2. More preferably there is at least 95%, more preferably at least 98%, 
homology to the sequence shown as SEQ ID No. 2. 

20 The above terms are synonymous with allelic variations of the sequences, 

The term "complementary" means that the present invention also covers nucleotide 
sequences that can hybridise to uhe nucleotide sequence of the present invention. 

25 The term "nucleotide" in relation to the present invention includes genomic DNA, 
cDNA, synthetic DNA, and RNA. Preferably it means DNA, more preferably cDNA 
for the coding sequence of the present invention. 

Preferably the nucleotide sequence is not a native nucleotide sequence, In this 
30 regard, the term "native nucleotide sequence" means an entire nucleotide sequence 
that is in its native environment and when operatively linked to an entire promoter 
with which it is namrally associated, which promoter is also in its native environment. 
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Thus, the enzyme of the present invention can be expressed by a nucleotide sequence 
in its native organism but wherein the nucleotide sequence is not under the control of 
the promoter with which it is naturally associated within that organism. 

The enzyme of the present invention may be used in conjunction with other enzymes. 

Preferably the enzyme is not a native enzyme. In this regard, the term "native 
enzyme" means an entire enzyme that is in its native environment and when it has 
been expressed by its native nucleotide sequence. 

The term "construct" - which is synonymous with terms such as "conjugate", 
"cassette" and "hybrid" - includes the nucleotide sequence directly or indirectly 
attached or fused to a promoter. An example of an indirect attachment is the 
provision of a suitable spacer group such as an intron sequence, such as the Shi- 
intron or the ADH intron, intermediate the promoter and the nucleotide sequence. 

In each case, it is highly preferred that the terms do not cover the natural combination 
of the gene coding for the enzyme ordinarily associated with the wild type gene 
promoter and when they are both in their natural environment. One highly preferred 
embodiment of the present invention therefore relates to the nucleotide sequence of 
the present invention operatively linked to a heterologous promoter. 

The construct may even contain or express a marker which allows for the selection 
of the genetic construct in, for example, a plant, such as potato, into which it has 
been transferred. Various markers exist which may be used, such as for example 
those encoding mannose-6-phosphate isomerase (especially for plants) or those 
markers that provide for antibiotic resistance - e.g. resistance to G418. hygromycin, 
bleomycin, kanamycin and gentamycin. 

The term "vector" includes expression vectors and transformation vectors. 



The term "expression vector" means a construct capable of in vivo or in vitro 
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expression. 

The term "transformation vector" means a construct capable of being transferred from 
one species to another - such as from an E.coli plasmid to an Agrobacienum to a 
5 plant. 

The term "tissue" includes tissue and organ, which tissue and organ can be isolated 
tissue and isolated organ, as well as tissue and organ when within an organism. 

10 The term "organism" in relation to the present invention includes any organism that 
could comprise the nucleotide sequence coding for the enzyme according to the 
present invention and/or products obtained therefrom, and/or wherein the nucleotide 
sequence according to the present invention can be expressed when present in the 
organism. 

15 

Preferably the organism is a plant. 

The term "transgenic organism" in relation to the present invention includes any 
organism that comprises the nucleotide sequence coding for the enzyme according to 
20 the present invention and/or the products obtained therefrom, and/or wherein the 
nucleotide sequence according to the present invention can be expressed within the 
organism. Preferably the nucleotide sequence is incorporated in the genome of the 
organism. 

25 Preferably the transgenic organism is a plant. 

Therefore, the transgenic organism of the present invention includes an organism 
comprising any one of. or combinations of, the nucleotide sequence coding for the 
enzyme according to the present invention, constructs according to the present 
30 invention, vectors according to the present invention, plasmids according to the 
present invention, cells according to the present invention, tissues according to the 
present invention, or the products thereof. For example the transgenic organism can 
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also comprise the nucleotide sequence coding for the enzyme of the present mvention 
under the control of a heterologous promoter. The transgenic orgamsm does not 
comprise the combination of a promoter and the nucleotide sequence coding for the 
enzyme according to the present invention, wherein both the promoter and the 
5 nucleotide sequence are native to that organism and are in their naairal environment. 

The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase 
binding site in the Jacob-Mond theory of gene expression. 

10 The promoter could additionally include one or more features to ensure or to increase 
expression in a suitable host. For example, the features can be conserved regions 
such as a Pribnow Box or a TATA box. The promoters may even contain other 
sequences to affect (such as to maintain, enhance, decrease) the levels of expression 
of the nucleotide sequence of the present invention. For example, suitable other 

15 sequences include the Shi -inivon or an ADH intron. Other sequences include 
inducible elements - such as temperature, chemical, light or stress inducible elements. 

Also, suitable elements to enhance transcription or translation may be present. An 
example of the latter element is the TMV 5* signal sequence (see Sleat Gene 217 
20 [1987] 217-225; and Dawson Plant Mol. Biol. 23 [1993] 97). 

Thus, in one aspect, the nucleotide sequence according to the present invention is 
under the control of a promoter that allows expression of the nucleotide sequence. 
In this aspect, the promoter may be a cell or tissue specific promoter. If, for 
25 example, the orgamsm is a plant then the promoter can be one that affects expression 
of the nucleotide sequence in any one or more of seed, stem, tuber, sprout, root and 
leaf tissues. 

General teachings of recombinant DNA techniques may be found in Sambrook,!., 
30 Fritsch, E.F., Maniatis T. (Editors) Molecular Cloning. A laboratory manual. Second 
edition. Cold Spring Harbour Laboratory Press. New York 1989. 
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Even though the enzyme and the nucleotide sequence of the present invention are not 
disclosed in EP-B-0470145 and CA-A-2006454, those two documents do provide 
some useful background commentary on the types of techniques that may be 
employed to prepare transgenic plants according to the present invention. Some of 
5 these background teachings are now included in the following commentary. 

The basic principle in the construction of genetically modified plants is to insert 
genetic information in the plant genome so as to obtain a stable maintenance of the 
inserted genetic material. 

10 

Several techniques exist for inserting the genetic information, the two main principles 
being direct introduction of the genetic information and introduction of the genetic 
information by use of a vector system. A review of the general techniques may be 
found in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol (1991) 42:205- 
15 225) and Christou (Agro- Food-Industry Hi-Tech March/ April 1994 17-27). 



Thus, in one aspect, the present invention relates to a vector system which carries the 
nucleotide sequence or construct according to the present invention and which is 
capable of introducing the nucleotide sequence or construct into the genome of an 
20 organism, such as a plant. 

The vector system may comprise one vector, but it can comprise two vectors. In the 
case of two vectors, the vector system is normally referred to as a binary vector 
system. Binary vector systems are described in further detail in Gynheung An et al. 
25 (1980), Binary Vectors, Plant Molecular Biology Manual A3, 1-19. 

One extensively employed system for transformation of plant cells with a given 
promoter or nucleotide sequence or construct is based on the use of a Ti plasmid from 
Agrobactenum lumefaciens or a Ri plasmid from Agrobacterium rhizogenes An et al. 
30 (1986), Plant Physiol. 81, 301-305 and Butcher D.N. et al. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J. P. Helgeson. 203-208. 
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Several different Ti and Ri plasmids have been constructed which are suitable for the 
construction of the plant or plant cell constructs described above. 

The nucleotide sequence or construct of the present invention should preferably be 
5 insened into the Ti-plasmid between the terminal sequences of the T-DNA or adjacent 
a T-DNA sequence so as to avoid disruption of the sequences inunediately 
surrounding the T-DNA borders, as at least one of these regions appear to be 
essential for insertion of modified T-DNA into the plant genome. 

As will be understood from the above explanation, if the organism is a plant, then the 
vector system of the present invention is preferably one which contains the sequences 
necessary to infect tne plant (e.g. the vir region) and at least one border part of a T- 
DNA sequence, the border pan being located on the same vector as the genetic 
construct. 

Furthermore, the vector system is preferably an Agrobacierium tumefaciens Ti- 
plasmid or an Agrobacierium rhizogenes Ri-plasmid or a derivative thereof, as these 
plasmids are well-known and widely employed in the construction of transgemc 
plants, many vector systems exist which are based on these plasmids or derivatives 
thereof. 

In the construction of a transgenic plant the nucleotide sequence or construct of the 
present invention may be first constructed in a microorganism in which the vector can 
replicate and which is easy to manipulate before insertion into the plant. An example 
25 of a useful microorganism is coli, but other microorganisms having the above 
properties may be used. When a vector of a vector system as defined above has been 
constructed in £. coli, it is transferred, if necessary, into a suitable Agrobacierium 
strain, e.g. Agrobacierium lumefaciens. The Ti-plasmid harbouring the nucleotide 
sequence or construct of the invention is thus preferably transferred into a suitable 
30 Agrobacierium strain, e.g. A. lumefaciens, so as to obtain an Agrobacierium cell 
harbouring the nucleotide sequence or construct of the invention, which DNA is 
subsequently transferred into the plant cell to be modified. 



10 



15 



20 
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As reponed in CA-A-2006454, a large amount of cloning vectors are available which 
contain a replication system in E. coli and a marker which allows a selection of the 
transformed cells. The vectors contain for example pBR 322, pUC series. M13 mp 
series, pACYC 184 etc. In this way, the nucleotide or construct of the present 
5 invention can be introduced into a suitable restriction position in the vector. The 
contained plasmid is used for the transformation in E.coli. The E,coli cells are 
cultivated in a suitable nutrient medium and then harvested and lysed. The plasmid 
is then recovered. As a method of analysis there is generally used sequence analysis, 
restriction analysis, electrophoresis and further biochemical-molecular biological 
10 methods. After each manipulation, the used DNA sequence can be restricted and 
connected with the next DNA sequence. Each sequence can be cloned in the same 
or different plasmid. 

After each introduction method of the construct or nucleotide sequence according to 
15 the present invention in the plants the presence and/or insenion of further DNA 
sequences may be necessary. If, for example, for the transformation the Ti- or Ri- 
plasmid of the plant cells is used, at least the right boundary and often however the 
right and the left boundary of the Ti- and Ri-plasmid T-DNA, as flanking areas of the 
introduced genes, can be connected. The use of T-DNA for the transformation of 
20 plant cells has been intensively smdied and is described in EP-A-120516; Hoekema, 
in: The Binary Plant Vector System Offset-drukkerij Kamers B.B., Alblasserdam, 
1985, Chapter V; Fraley, et ah, Crit. Rev. Plant Sci., 4:1-46; and An et al., EMBO 
J. (1985) 4:277-284. 

25 Direct infection of plant tissues by Agrobacterium is a simple technique which has 
been widely employed and which is described in Butcher D.N. et al. (1980), Tissue 
Culture Methods for Plant Pathologists, eds.; D.S. Ingrams and J. P. Helgeson, 203- 
208. For further teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant 
Mol Biol [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/ April 

30 1994 17-27). With this technique, infection of a plant may be done on a cenain pan 
or tissue of the plant, i.e. on a pan of a leaf, a tuber, a root, a stem or another pan 
of the plant. 
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Typically, with direct infection of plant tissues by Agrobacterium can7ing the 
nucleotide sequence of the present invention, a plant to be infected is wounded, e.g. 
by cutting the plant with a razor or puncturing the plant with a needle or rubbing the 
plant with an abrasive. The wound is then inoculated with the Agrobacterium, The 
5 inoculated plant or plant part is then grown on a suitable culture medium and allowed 
to develop into mature plants. 

When plant cells are constructed, these cells may be grown and mainuined in 
accordance with well-known tissue culturing methods such as by culturing the cells 
10 in a suitable culture medium supplied with the necessary growth factors such as amino 
acids, plant hormones, vitamins, etc. 

Regeneration of the transformed cells into genetically modified plants may be 
accomplished using known methods for the regeneration of plants from cell or tissue 
15 cultures, for example by selecting transformed shoots using an antibiotic and by 
subculturing the shoots on a medium containing the appropriate nutrients, plant 
hormones, etc. 

Even funher useful teachings on the transformation of plants can be found in Danish 
20 patent application No. 940662 (filed 10 June 1994) and/or United Kingdom patent 
application No. 9702592.8 (filed 7 February 1997). 

Reference may even be made to Spngstad et al (1995 Plant Cell Tissue Organ Culture 
40 pp 1-15) as these authors present a general overview on transgenic plant 
25 construction. 

In summation, the present invention relates to an enzyme having a(l,4) glucan acetyl- 
transferase activity and a nucleotide coding for same. The present invention also 
provides a modified carbohydrate (preferably starch) obtainable from use of the same. 

30 

The following sample was deposited in accordance with the Budapest Treaty at the 
recogmsed depositary The National Collections of Industrial and Marine Bacteria 
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Limited (NCIMB) at 23 St. Machar Drive, Aberdeen. Scotland. United Kingdom. 
AB2 IRY on 7 March 1996: 

5 DH5a-pMAC3 (which contains a 3.2 kb EcoRl-Pstl fragment from E. coli comprising 
the mac gene). 

The deposit number is NCIMB 40789. 
10 This deposit concerns the plasmid pMAC3. 

The following sample was deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland. United Kingdom. 
15 AB2 IRY on 7 March 1996: 

NF1830-pMAC5 (which contains the E.coli mac gene). 
The deposit number is NCIMB 40790. 

20 

This deposit concerns the plasmid pMAC5 . 

A highly preferred aspect of the present invention therefore relates to an enzyme 
having a(l,4) glucan acetyl-transferase activity, wherein the enzyme comprises the 
25 amino acid sequence shown as SEQ ID No. 1, or a variant, homologue or fragment 
thereof; and wherein the enzyme is expressed by a nucleotide sequence obtainable 
from either deposit number NCIMB 40789 or deposit number NCIMB 40790. 

Another highly preferred aspect of the present invention therefore relates to a 
30 nucleotide sequence comprising the sequence shown as SEQ ID No. 2, or a variant, 
homologue or fragment thereof or a sequence that is complementary thereto, and 
wherein the nucleotide sequence is obtainable from either deposit number NCIMB 
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40789 or deposit number NCIMB 40790. 

The present invention also provides a modified carbohydrate (preferably starch) 
obtainable from use of this same plasmid. 

5 

The present invention will now be described only by way of example in which 
reference is made to the following Figures: 

Figure 1 which shows the nucleotide sequence corresponding to SEQ ID No. 2; 

10 

Figure 2 which shows the amino acid sequence corresponding to SEQ ID No. 1; 

Figure 3 which shows a nucleotide sequence comprising the sequence 
corresponding to SEQ ID No, 2; 

15 

Figure 4 which is a plasmid map of pMACl; 
Figure 5 which is a plasmid map of pMAC2; 
20 Figure 6 which is a plasmid map of pMAC3; 
Figure 7 which is a plasmid map of pMAC5; 
Figure 8 which is a plasmid map of pMAC8; 

25 

Figure 9 which is a plasmid map of pMAC9; and 
Figure 10 which is a plasmid map of pMAClO. 
30 Some details on the Figures are as follows: 
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Figure 1 

Nucleotide sequence corresponding to Seq ID No 2 
Figure 2 

5 Amino acid sequence corresponding to Seq ID No i 
183 amino acids 
20073 MW 



Figure 4 

10 Plasmid name: pMACl 
Plasmid size: 7.26 kb 

Comments: Insertion of a 4.3 kb EcoRl fragment from lambda 151 into the EcoRl 
site of pBluescript II SK -f . 

15 Figure 5 

Plasmid name: pMAC2 
Plasmid size: 7.26 kb 

Conunents: Insertion of a 4.3 kb EcoRl fragment from lambda 151 (Kohara 
collection) into the EcoRl site of pBluescript II SK -f . 

20 

Figure 6 

Plasmid name: pMAC3 
Plasmid size; 7.26 kb 

Comments: Deletion of the 1.1 kb Pstl fragment from pMAC2. 

25 

Figure 7 

Plasmid name: pMAC5 
Plasmid size: 4060 bp 
Conmiems: 

30 The £ coli mac gene was amplified with primers: 
#B411 (upper primer with EcoRl site) 
CGGAATTCC GCC ATG AAG ACA TAC CC 
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#B412 (lower primer with Hindlll site) 

CAC AAG CTT ATT TTG CAT AAC AGT TGC 

using pMAC3 as template. 

The 704 bp PGR product was digested with EcoRl and Hindlll and insened in 
pUHE21-2 digested with the same restriction enzymes . 

Figure 8 

Plasmid Name: pMACS 
Plasmid size: 4935 bp 

Comments: The E coli mac gene was amplified with primers 

# B 478 CG G GAT CC G AGC ACA GAA AAA GAA AAG ATG (upper 
primer with BamHl site) 

# B 479 AA C TGC AG A TTT TGC ATA ACA GTT GC (lower primer with 
Psil site) 

and pMAC3 as template. The PGR product was digested with BamHl and Pstl and 
inserted in pBETP5 digested with the same enzymes. 

The SBE TP-mac fusion was control sequenced with primer # C028 

The 35S terminator-mac fusion was sequenced with primer # B456 og # C027. 

Figure 9 

Plasmid name: pMAC9 
Plasmid size: 9.37 kb 

Comments: Insenion of the 2294 bp EcoRl fragment (Patatin promoter-SBE TP-mac - 
35S terminator) from pMAC8 in the EcoRI site of pVicior IV Man. 
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Figure 10 

Plasmid name: pMAClO 
Plasmid size: 9.37 kb 

Comments: Insenion of a 2294 bp EcoRl fragment (Pataiin promoter-SBE TP-mac- 
5 35S terminator) from pMAC8 in the EcoRl site of pVictor IV Man. 

Cloning and sequencing of the mac gene from E. colt. 

Following, initially the teachings of Boos and Brand (3), the mac gene was isolated 
10 from the 4.3 kb EcoRl fragment from X phage 8C4 (151) from the Kohara collection 
(4). The fragment was inserted into the EcoRl site of plasmid pBluescript II SK ( + ) 
in both orientations yielding plasmids pMACl and pMAC2 (Figures 4 and 5). When 
harboured in E. colt these plasmids gave rise to highly elevated maltose 
acetyltransferase levels indicating that the 4.3 kb EcoRl fragment contains the mac 
15 gene, 

In order to localise the mac gene on the 4.3 kb EcoRl fragment, the 1.1 kb Pstl 
fragment was deleted from plasmid pMAC2. This plasmid construction pMAC3 
(Figure 6) also gave rise to increased maltose acetyltransferase levels in strains 
20 containing this plasmid, thus demonstrating that the mac gene is present on the 3.2 
kb EcoRl-Pstl fragment. 

The nucleotide sequence of the 3.2 kb EcoRl-Pstl insen in pMAC3 was then 
determined by automated sequencing on an A.L.F. sequencer. The 3137 bp DNA 
25 sequence revealed a 372 bp region of the 3' end of the E. coli acrB gene and three 
open reading frames potentially encoding proteins of 124, 126, and 183 amino acids 
(Figure 3). 

In accordance, "S-methionine labelling experiments with £. coU minicells containing 
30 pMAC3 showed the synthesis of proteins having molecular weights corresponding to 
these sizes. 
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The 183 codon orf which encodes a protein of a predicted molecular weight of 20073 
(Figure 2) is the mac gene, since the £. coli maltose acetyl-transferase has an 
estimated subunit molecular weight of 20.000 (3). 

5 Over-expression of the Mac enzyme in E. colL 

In order to purify the Mac enzyme, the mac gene was inserted after an 
isopropylthiogalactosidase (IPTG) inducible phage TV-promoter Al in pUHE21-2 to 
give pMACS (Figure 7). Cultures of E coli strain NF1830 (MClOOO. recAl. 
10 lacIqlZ::tm5, a gift from Niels Fiil, University of Copenhagen) harbouring pMAC5 
was found to have highly elevated levels of maltose acetyltransferase, when 
expression of the mac gene is induced by addition of IPTG to the growth medium. 

Growth Conditions 

15 

A 1 L LB culture of NF1830-pMAC5 supplemented with ampicillin (100 fig/ml) and 
kanamycin (25 fig/ml) was grown at 3T'C with vigorous shaking until the A6(X) 
reached 0.7. IPTG was added to a final concentration of 2niM and growth was 
continued for four hours. The ceils were harvested by centrifugation (10 min. at 4 

20 000 X g) and washed by resuspension in 200 ml 0.9% NaCl. The cell pellet was then 
resuspended in 250 ml 20 mM potassium phosphate pH 7.5 containing 0.4 noM 
PMSF, 0.4 mg/ml pepstatin and 1.6 mM EDTA. The suspension was sonicated 5 x 
1 min. using a Vibra Cell VC 600 with a 19 mm High Gain Horn and extender (all 
from Sonics and Materials Inc., USA). The homogenate was clarified by 

25 centrifugation for 60 min. at 90 000 x g at 4°C and subsequent filtration through a 
0.22 /xm filter. 



wo 97/33974 PCT/EP97/01 117 

19 

Purification of Recombinant Mac 



The resulting crude extract was applied to a Q-Sepharose 26/10 column t Pharmacia 
Biotech) equilibrated with 20 mM potassium phosphate pH 7,5 (hereinafter called 

5 "buffer A") at a flow rate of 2 ml/min. The column was washed with 300 ml of 
buffer A and the bound protein was eluted by applying a 0 to 0.3 M NaCl linear 
gradient in buffer A (300 ml). The fractions containing enzyme activity were pooled 
and applied to a 8 ml Affi-Gel Blue (Biorad) column (16 mm x 26 cm) equilibrated 
with buffer A at a flow rate of 1 ml/min. The column was washed with 50 ml of the 

10 same buffer containing 0.4 M NaCl. The enzyme was then eluted with the same 
buffer containing 2 M NaCl. The active pool was dialysed overnight against buffer 
A and subsequently concentrated to approximately 3 ml in a Centriprep-30 (Amicon). 
This fraction was applied to a 6 ml Aceiyl-coA-Minileak column equilibrated with 
buffer A at a flow rate of 0.3 ml/min. This affinity resin was made by coupling 200 

15 mg of Acetyl-coA to 5 g (dry weight) of Minileak High (Kem-En-Tek, Denmark) in 
10 ml of 1 M NaCOj pH 11 for 20h at room temperamre. The column was washed 
with 20 ml of buffer A. It was then turned upside down and the pure enzyme was 
eluted in less than 20 ml with buffer A containing 0.5 M NaCl. 

20 The purification of the maltose acety transferase to homogeneity was achieved after 
three chromatographic steps. From 11 culture we were able to get 5.8 mg pure Mac. 
The yield was 29% and the enzyme was purified 80-fold. The purity of the enzyme 
was assessed both by SDS-PAGE and mass spectrometry. The latter revealed a 
molecular mass of 19,982 Da. 



25 
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Determination of enzyme concentration and activity 

The concentration of pure Mac solutions was estimated spectrophotomeirically at 280 
nm using an extinction coefficient of 0.66 as determined from the amino acid 
5 composition of Mac according to (5). The acetyl-transferase activity of Mac was 
assayed spectrophotometrically according to a modified Alpers' assay (6). A Perkin 
Elmer Lambda 18 spectrophotometer was used. The assay mixture of a total volume 
of 1 ml contained a 50 mM potassium phosphate, 2 mM EDTA buffer at pH 7.5, 100 
^1 of maltose IM, 100 ^\ of Acetyl-coA 0.4 mM, 10 ^1 5,5'-dithiobis(2-nitrobenzoic 
10 acid) (DTNB) 40 mM dissolved in methanol and 10 /xl enzyme. The reaction was 
staned by the addition of enzyme or maltose and was monitored at 412 nm at 25°C. 
One activity unit was defined as the amount of enzyme that produced an increase in 
absorbance of 1 per minute at 25°C. An extinction coefficient of 13 600 M"' x cm ' 
was used for DTNB in order to calculate the consumption of acetyl coenzyme A. 

15 

N-terminal sequencing of Recombinant Mac 

N-lerminal sequencing of pure Mac was performed using an Applied Biosystems 
476A protein sequencer. One nanomole of protein was desalted by RP-HPLC on a 
20 C2 column (4.6/30) prior to loading onto the sequencer. The N-terminal sequence 
of Mac was determined up to residue was determined up to residue 48 and was in 
complete concordance with the nucleotide sequence of the mac gene (Figure 1). 
Furthermore, the N-terminal methionine residue was not present on the mature 
protein (Figure 2). 

25 

Production of polyclonal antibodies against Recombinant Mac 

Rabbits were immunised subcutaneously at 2-week intervals during 6 weeks and at 
4-week intervals thereafter with 90 fig of pure protein emulsified (1:1, vol/vol) with 
30 Freund's adjuvant. Antisera were tested against Mac in immunoblots and were found 
highly specific. 
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Characterisation and activity profile of recombinant Mac 

Mass spectrometry studies indicated that Mac may be a trimer. 

5 The isoelectric point of Mac was determined by isoelectric focusing on a PhastGel 
lEF 4-6.5 (Pharmacia) and was found to be 5.7. 

The pH profile of Mac was investigated between pH 5 and 8.5 at a 100 mM maltose 
concentration in 50 mM buffers containing 100 mM NaCl. Under these conditions. 
10 the pH optimum was 7.7. 

The pH stability of Mac was examined at 25 "C between pH 3.0 and 10.0. Mac was 
instantaneously inactivated at pH 3.0 but was stable between pH 4.0 and 10.0 for at 
least six hours. 

15 

The thermostability of Mac was investigated at pH 7.5 between 40 and 70'C. After 
incubation for four hours at 40''C and 50°C, the remaining activity of Mac was 100% 
and 75%, respectively. Its half-life was 70 min, and 22 min at 60°C and 70°C, 
respectively. 

20 

The substrate preference of Mac towards the carbohydrate acetyl-acceptor substrate 
was investigated by measuring the initial rate of the acetylation of various 
carbohydrates (used at 50 and 100 mM concentrations) following the procedure 
described in "Determination of enzyme concentration and activity". The results are 
25 presented in Tables 1, 2 and 3. Among the monosaccharides tested, glucose was the 
best substrate and among the disaccharides tested, maltose and isomaltose were the 
best substrates. 
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Table 1. Comparison of the relative activity of Mac towards various 
monosaccharides as acetyl-acceptors. 





Relative Activity 




(% of activity on glucose) 


Glucose 


100 


Mannose 


38 


Fructose 


17 


Galactose 


0.9 



Table 2. Comparison of the relative activity of Mac towards various disaccharides 
as acetyl-acceptors. 



15 


Substrate 
(100 mM) 


Relative Activity 

(% of activity on maltose) 




Maltose (Qr-glucose(l,4) a-giucose) 


100 




Isomaltose (a-giucose(l,4) a-glucose) 


110 




Lactose (/3-gaiactose j3-(l,6) a-glucose) 


0.4 


20 


Sucrose (a-glucose a-(l,4) |8-fructose) 


0.4 




Cellobiose (jS-glucose /J-(l,4) /3-glucose) 


0 
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Table 3. Comparison of the relative activity of Mac towards various 
maltooligosaccharides as acetyl-acceptors. 





Substrate 


Relative Activity 


5 


(50 mM) 


(% of activity on maltose) 




Maltose 


100 




Maltotriose 


7.5 




Maltotetraose 


0.5 




Maltopentaose 


0.9 


10 


Maltohexaose 


1.2 




Maltoheptaose 


1.1 



Kinetic studies 

15 

Kinetic studies of Mac catalysed acetylation reactions revealed that the Km for the 
acceptor substrate is in the mM range whereas it is in the fiM range for 
acetyl-coenzyme A. Thus, Mac has about a 1000 fold more affinity for 
acetyl-coenzyme A than for the acceptor. 

20 

NMR studies 

'H-NMR strucmre determination of the products of acetylation of glucose and maltose 
by Mac was investigated. 

25 

In order to investigate the substrate regio-specificity of Mac regarding the acetylation 
site of the acceptor substrate, we prepared miUigram amounts of acetylated glucose 
and maltose by incubating 10 mg of glucose or maltose with £. coli Mac and 1 mg 
acetyl-coenzyme A in phosphate buffer at pH 7.5 for 48 hours. Additional aliquots 
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of I mg acetyl-coenzyme A were added during the incubation. The reaction products 
were separated by thin layer chromatography and the acetylated glucose and maltose 
were isolated from the chromatogram and freeze dried. The structures of these 
acetylated sugars were determined by ^H-NMR. Glucose was only acetylated at the 
5 C6 position, and maltose was acetylated at the C6 position of its non-reducing glucose 
moiety. These results reveal that Mac acetylates hexoses at their C6 position. 

Activity of the SBE-Mac fusion in E. coli. 

10 Because the 27 amino acid SBE portion of the SBE-Mac fusion in pMAC9 and 
pMAClO described below may interfere with the acety transferase activity, the 
SBE-Mac fusion was inserted in the £. coli expression vector pAL781 (Invitrogene, 
San Diego. USA) in order to over-express the fusion enzyme in E, coli and analyse 
the activity. A comparison of the highly over-expressed SBE-Mac fusion and the 

15 purified wild type Mac enzyme on SDS gels showed that the fusion migrated slightly 
slower due to the 27 amino acid extension. Moreover, the fusion retained the ability 
to use maltose as a substrate for acetylation. Thus, the fusion enzyme appears to be 
intact and is fully active in E. coli. Therefore, it may be assumed, that the SBE-Mac 
fusion enzyme will be active in potatoes. 

20 

IN VIVO MODIFICATION OF STARCH IN POTATO 

General teachings on potato transformation may be found in our copending patent 
applications PCT/EP96/03053, PCT/EP96/03052 and PCT/EP94/01082 (the contents 
25 of each of which are incorporated herein by reference). 



For the present studies, the following protocol was adopted. 
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Construction of plasmids for the expression of the E coli mac gene in potato. 

The E coli mac gene was amplified with primers: 

5 5'-CGG GAT CCG AGC AC A GAA AAA GAA AAG ATG-3^ (upper primer with 
BamHl site) 

and 

10 5'-AAC TGC AGA TTT TGC ATA ACA GTT GC-3^ (lower primer with Pstl site) 
and pMAC3 as template. 

The PGR product was digested with BamHl and Pstl and inserted in pBETPS (see 
15 PCT patent application No. WO 94/24292, the contents of which are incorporated 
herein by reference) digested with the same enzymes yielding pMAC8. Thereby, the 
mac gene is inserted in an expression cassette that provides tuber specific expression 
from a patatin promoter and transcription termination at a CaMV 35S terminator. 
Moreover, the Mac enzyme is fused to 102 amino acids of the N-terminus of the 
20 potato starch branching enzyme including a 75 amino acid transit peptide that directs 
the mac gene product to the potato tuber amyloplasts. Upon import to the amyloplast 
the 75 amino acid transit peptide is cleaved off, to give a Mac fusionprotein that has 
the 27 amino acids from the mature starch branching enzyme N-terminus. The 2294 
bp EcoRl expression cassette was isolated from pMAC8 and inserted in the EcoRl site 
25 of the plant transformation vector pVictor IV Man (see PCT patent application No. 
WO 94/24292 and British patent application No. 951443.8, the contents of each of 
which are incorporated herein by reference) giving plasmids pMAC9 and pMAClO 
(Figures 9 and 10, respectively). 
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Preparation of potato minitubers 

A segment containing the nodium - i.e. a segment taken from 2 mm above and 5 mm 
below the nodium - was cut from in vitro grown potato plants or mannose selected 
5 shoots (for mannose selection see our earlier patent applications WO 93/05 163 and/or 
WO 94/20627). The leaf was removed from the nodium segment, and the segment 
was placed vertically on agar plates with MS medium (Sigma) supplemented with 60 
g sucrose/1 and 2 mg 6-benzyl-aminopurine/l. The nodium segments were grown for 
7 days at 20°C with a 16 hour light period and an 8 hour dark period. Subsequently, 
10 the plates were wrapped in alu-foil and placed in the dark at 20"C. The minitubers 
were harvested after about 28 days and applied for western analysis in order to detect 
Mac expression. 

Expression of the SBE-Mac fusion in potato minitubers 

15 

Potato minitubers transformed with the pMAC9 or pMAClO constructs were 
examined by Western analysis for expression of the £. coli mac gene with antibodies 
raised towards the E. coli maltose acety [transferase. The analysis clearly 
demonstrated that 3 out of 5 MAC9 minimbers and 5 out of 7 MAC 10 minitubers 

20 gave a distinct expression of the £. coli maltose acetyltransferase. The positive 
minitubers expressed a 209 amino acid SBE-Mac fusion that co-migrates with a 
similar construction expressed in E. coli. These results indicate that the 75 amino 
acid SEE transit peptide, that was originally fused to the 209 amino acid SBE-Mac 
fusion, has been removed from the SBE-fusion. Furthermore, this implies that the 

25 transit peptide was correctly processed by the signal peptidase in the amyloplast 
membrane, and that the SBE-Mac fusion has been directed to the amyloplast. 
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Immunoblots on potato tuber extracts 

0.5 ml potato protein extract was precipitated with 20% TCA tor 30 min on ice. 
Protein precipitates were recovered after centrifugation and resuspended in 50 ^1 of 
5 SDS-PAGE sample buffer. 25 ^1 were subsequently loaded onto 15 % 
polyacrylamide gels. After electrophoresis proteins were transferred onto Problot 
PVDF membranes by semi-dry blotting. For immunodetection Mac antiserum was 
diluted 1;2 000 and secondary antibody was coupled to alkaline phosphatase. 

10 In accordance with the Western Blot analysis of the minitubers described above, the 
western analysis of the transgenic mbers clearly demonstrated that the 209 a SBE-Mac 
fusion is expressed in the tubers. 

Analysis of potato tubers for Mac activity 

15 

Potato tubers of comparable sizes were chosen and cut into pieces and homogenised 
in extraction buffer and Dowex (1%. w/vol) using a mortar and pestle or an electric 
blender. 5 ml extraction buffer (50 mM potassium phosphate pH 7.5, 2 mM EDTA. 
0.5 mM PMSF) was used per gram potato. The mixmre was allowed to stand on ice 
20 for 30 min and the insoluble material was removed by centrifugation. Protein 
concentration was measured using the BCA reagent (Pierce). 

Mac activity was measured in duplicates or triplicates as follows: 0, 50, 100 or 200 
Ml potato extract. 10 (xl of 1 mM acetyl-coenzyme A, 25 ^1 of 1 M glucose and assay 

25 buffer (50 mM potassium phosphate, 2 mM EDTA, pH 7.5) were mixed per 
microtiter plate well to give a total volume of 250 fi\. The reaction was started by 
the addition of acetyl-coenzyme A. After 10 min. reaction at room temperature, 25 
/il of freshly made 4 mM DTNB was added and A^os was measured immediately. 
Two wells were prepared for each single assay, one with glucose and one without. 

30 Activity was calculated by subtracting the absorbance of the well without glucose 
(background absorbance) from that of the well with glucose. 
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Relatively high levels of Mac activity could be measured in eight out of nine 
transgenic tubers. Some of the tubers had a Mac activity that was 15 to 20 fold 
above the almost negligible activity found in non-transformed njbers. 

5 Viscometric studies 

Samples of starch obtained from tubers of non-transformed potatoes and from 
transformed potatoes according to the present invention were analysed by 
viscoamylograph of an aqueous suspension using a Newport Scientific Rapid Visco 
10 Analyser 3C. The results showed that the starch from the transformed potatoes had 
a different viscometric profile to the starch from the non-transformed potato. 

DSC studies 

15 Samples of starch obtained from tubers of non-transformed potatoes and from 
transformed potatoes according to the present invention were analysed by differential 
scanning colometry (using a 10% w/w aqueous starch suspension). The samples were 
heated from 20 to 100°C at a velocity of 10°C per minute. The results showed that 
the starch from the transformed potatoes had a different enthalpy to the starch from 

20 the non-transformed potato. We additionally found a difference in gelatinisaiion 
temperamre for the transformed potatoes compared to the starch from the non- 
transformed potatoes. 



Other modifications of the present invention will be apparent to those skilled in the 
25 art. 
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SEQUENCES 

SEQUENCE [Q NO 1 
5 Amino acid sequence 

MSTEKEKMIAGELYRSADETLSRDRLRAROL [HRYNHSLAEEHTLRQQl L 50 
ADLFGOVTEAY ! EPTFRCDYGYN I FLGNNFFANFDCVMLDVCP I RIGDNC 1 00 

10 

MLAPGVH I YTATHP I DP VARN5GAELGKPVT IGNN VW I GGRAV I NPG VT I 1 50 
GDNVVVASGAVVTKDVPDNVVVGGNPARIIKKL 193 



15 




0 m 2 








Nucleotide 


sequence 








ATG/,GCACAG 


AAAAAGAAAA 


GATGATTGCT 


GGIGAGHGT 


20 


ATCGCTCGGC 


AGATGAGACG 


HATCTCGCG 


ATCGCCTGCG 




CGCTCGTCAG 


CnATTCACC 


GATACAATCA 


TTCCCTGGCG 




GAAGAGCACA 


CAHACGCCA 


GCAAAHCTC 


GCTGATCTAT 




TCGGTCAGGT 


GACAGA6GCT 


TATAHGAGC 


CAACGTHCG 




CTGTGACTAT 


GGCTATAACA 


TTTTTCTCGG 


TAATAAl 1 1 1 


25 


TTCGCCAACT 


TCGAHGCGT 


GATGCHGAT 


GTCTGCCCTA 




TTCGCATCGG 


TGATAACTGT 


ATGTTGGCAC 


CAGGCGHCA 




TATCTACACG 


GCAACACATC 


CCATCGACCC 


TGTAGCACGT 




AATAGCGGTG 


CTGAACTGGG 


GAAACCCGTC 


ACCATCGGTA 




ATAACGTCTG 


GA'^TGGCGGA 


CGCGCGGTCA 


TT.AACCCTGG 


30 


TGTGACCATT 


GGTGATAACG 


TCGTGGTAGC 


CTCAGGTGCA 




GTTGTCACAA 


AA3ATGTCCC 


GGACAACGH 


GTCGTGGGCG 




GTAATCCAGC 


CAGAATAATT 


AAAAAATTGT 


a;\ 
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SFQIJENCE I D NO 3 
Nucleotide sequence 

Complete nucleotide sequence of the 3.2 kb EcoRl-Pstl fragment in pMAC3 

5 

GAATTCGCCA AAGACTTGAT GGATAAAGAA GGTAAAGGTC TGATTGAAGC GACGCTGAT 60 

GCGGTGCGGA TGCGnTACG TCCGATCCTG ATGACCTCGC TCGCGTHAT CCTCGGCGTT 120 

ATGCCGCTGG TTATCAGTAC TGGTGCTGGT TCCGGCGCGC AGAACGCAGT AGGTACCGGT 180 

GTAATGGGCG GGATGGTGAC CGCAACGGTA CTGGCAATCT TCnCGTTCC GGTATTCTTT i40 

10 GTGGTGCnC GCCGCCGCTT TAGCCGCAAG AATGAAGATA TCGAGCACAG CCATACTGTC 300 

GATCATCATT 6ATACAACGT GTAATCACTA AGGCCGCGTA AGC6GCCTTT TITATGCATA 360 

ACCTACGAAC ATTAAGGAGT AATTGAACCA CCAACTCAGG ATCTCATACG AAAACCAGTA 420 

TTAACCACGG ATAAAATTCA TAAAAAATAC TGATTGHAG TTAATHATA TTAAGTAGCG 480 

CTAATAGATT TAATAATCCA TAATCATTTA GAGGCTATTC TTAATTATn GCGGTAATTC 540 

15 TTTATTCATT CCTCGGTTAT TAC6TCATAT TCAGAGCAAT CCTGGTATTA GTGTCACCAA 600 

TTTCATCTGG CGATAATCCT GAAATGTTAT GAATAGTTCG AGCAAACTGC TTTTACCTGC 660 

TGCGGGTTAG TGCTAGTATG .AAAAAGTGAG TCCTGTCCCG CnCCTTCCT AATTGTMTT 720 

TTTCGTAATA ATGCGATG.AA ^ACCTGCAAA GAGTGGCTTA TAGTTAAGCT AACAAACGAG 780 

AGGGCAAGTC CAGGTCAGTA AGTTTTnCC ATCCCGAAAG GTGTCCGTTA GTTCAACCGC 840 

20 TAAGAAG6GG ACGCGTTATG GATGAATACT CACCCAAAAG ACATGATATC GCACAGCTTA 900 

AGTTTCTCTG TGAAACCCTG TATCAT6ACT GCCHGCAAA CCHGAAGAA AGCAATCATG 960 

GCTGGGTAAA CGACCCAACC TCGGC6ATCA ACCTCCAGH GAATGAACTG ATTGAGCATA 1020 

TTGCGACCTT CGCACTTAAT TACAAAATTA AGTATAATGA AGACAATAAG CTCAHGAGC 1080 

AGATCGACGA ATATCTGGAT GACACCHTA TGHGnCAG TAGTTATGGT ATTAATATGC 1140 

25 AGGATCTTCA GAAATGGCGG AAGTCAG6TA AHCGACTATH CCGTTGTTTT GTCAATGCGA 1200 

CGAAAGAGAA TCCTGCGAGT HATCncn AGAAnATTA CAACCATAGG TAGAAGTATG 1260 

TCCGAAAAAC CTTTAACGAA AACCGATTAT TTAATGCGH TACGTCGTTG CCAGACAATT 1320 

GACACGCTGG AGCGGTTTAW TCGAGAAAAA TAAATACGAA HATCAGATA ATGAACrCGC 1380 

GGTATHTAC TCAGCCGCAG ATCACCGCCT CGCCGAATTG ACCATGAATA AACTGTACGA 1440 

30 CAAGATCCCT TCCTCAGTAT GGAAATTTAT TCGCTAATAA ATAATTCGCT TTCGGAGCTA 1500 

TAACCGGCTG mAHAAGA ATTTTATACT TTTTCGCCAT GAAGACATAC CCTATGTGAT 1560 

CTTTATCACA CAGATGTAAT GGGAACGTTC TCHCACTGA CTTTTCGTCT TACTGIGHG 1620 

CCGCATTTTC AGCAACCGGA GTCAGTAATG AGCACAGAAA AAGAAAAGAT GAHGCIGGT 1680 

GAGTTGTATC GCTCGGCAGA TGAGACGTTA TCTCGCGATC GCCTGCGCGC TCGTCAGCTT 1740 

35 ATTCACCGAT ACAATCATTC CCTGGCGGAA GAGCACACAT TACGCCAGCA AAHCTCGCT 1800 

GATCTAHCG GTCAGGT6AC AGAGGCHAT ATTGAGCCAA CGTTTCGCTG TGACTATGGC 1860 

TATAACATTT TTCTCGGTAA TAATTTTnC GCCAACnCG AHGCGTGAT GCTTGATGTC 1920 

TGCCCTAHC GCATCGGTGA TAACTGTATG HGGCACCAG GCGHCATAT CTACACGGCA 1980 

ACACATCCCA TCGACCCTGT AGCACGTAAT AGCG6TGCTG AACTGGGGAA ACCCGTCACC 2040 

40 ATCGGTAATA ACGTCTGGAT TGGCGGACGC GCGGTCAnA ACCCTGGTGT GACCATTGGT 2100 

GATAACGTCG TGGTAGCCTC AGGTGCAGTT GTCACAAAAG ATGTCCCGGA CAACGTTGTC 2160 

GTGGGCGGTA ATCCAGCCAG AATAAHAAA AAAHGTAAT CGGTTTTTCG CAACTGHAT 2220 

GCAAAAHGT GGTAGATCT6 TTACTTCCCC TCTACTAHC CCACGTTAAA ATAGGGTGTT 2280 

CCCTGGAAAG HGCAGATAC CACGAAGGCA AACGATGACC GAAATACAAC GCCTGCTGAC 2340 

45 CGAAACGAH GAGTCTCTGA ATACCCGCGA AAAACGCGAC AACAAACCCC GCTTTAGTAT 2400 
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CAGTHTATC CGTAAACATC CGGGGCTGTT TATCGGTATG TACGTTGCTT TTTHGCCAC 2460 

CCTGGCGGTG ATGHGCAGT CCGAAACGCT GTCAGGCTCT GTCTGGCTAC TGGnGTATT 2520 

ATTTATCCTG CTTAATGGTT TCTTCTTTTT CGATGTCTAC CCACGCTACC GCTATGAAGA 2580 

TATCGACGTG CTGGATTTCC GCGTTTGCTA TAACGGCGAA TGGTACAACA CGCGCTTTGT 2640 

5 ACCTGCCGCG CTGGTTGAAG CCATCTTGAA CTCTCCGTGT CGCGGATGTT CATAAGGAAC _700 

AACTGCAAAA AATGATCGTC CGTAAAGGTG AACTGTCTTT TTACGATATT TTTACCCTCS 2760 

TCGCGCCGAA TCAACATCTT AAGTTAGGGT TACATACCAG GCGTAAAGCT CTGCGCCTGG 2920 

TCAAATGACA ATGATCGTTT CCACCCATCA CTTCATGAAA TACCAGCTCT ACCTCCHAT 2880 

CTCCAGCCAG CCTTTnCCA CAATCAGATA TACTTTCCCT ACACTGTGTT AATAAG6ATA 2940 

10 TGCTGGTGAG AACACGACAT CTGGTCGGCC TTATTTCGGG A6TACTGATT CTHCAGTA: 3000 

TGCTGCCTGT CGGCTTAAGC ATCTGGCTGG CCCATCAGCA GGTAGAAACA TCGITTAHG 3060 

AAGAGCTGGA TACCTAHCC TCCCGCGTCG CTATTCGAGC CAATAAGGTG GCGACACAAG 3120 

GGAAAGATGC GCTGCAG 3137 

15 SEQUENCE ! D NO 4 

Complete nucleotide sequence of the 3 2 kb EcoR\-Fst\ fragment m pMAC3 The Mac 
enzyme amino acid sequence u also shown below the mac gene coding sequence. 

20 GAAnCGCCAAAGACTTGATGGATAAAGAAGGTAMGGTCTGATTGAAGCGACGCTTGAT 60 

GCGGTGCGGATGCGTTTACGTCCGATCCTGATQACCTCGCTGGCGnTATCCTCGGCGTT 120 

ATGCCGCTGGTTATCAGTACTGGTGCTGGnCCGGCGCGCAGAACGCAGTAGGTACCGGT 180 

GTAATGGGCGGGATGGTGACCGCAACGGTACTGGCAATCTTCnCGTTCCGGTAnCTn 240 

GTG6TGGTTCGCCGCCGCTTTAGCCGCAAGAATGAAGATATCGAGCACAGCCATACTGTC 300 

25 GATCATCAnGATACAACGIGTAATCACTAAGGCCGCGTAAGCGGCCmrnATGCATA 360 

ACCTACGAACATTAAGGAGTAAnGAACCACCAACTCAGGATCTCATACGAAAACCAGTA 420 

TTAACCACGGATAAAATTCATAAAAAATACTGAnGTTAGTTAATTTATATTAAGTAGCG 480 

CTAATAGATTTAATAATCCATAATCATTTAGAGGCTATTCTTAATTATTTGCGGTAATTC 540 

TTTAnCATTCCTCGGTTATTACGTCATAnCAGAGCAATCCTGGTATTAGTGTCACCAA 600 

30 TTTCATCTGGCGATAATCCT6AAATGTTATGAATAGTTCGAGCAAACTGCTTTTACCTGC 660 

TGCGGGTTAGTGCTAGTATGAAAAAGTGAGTCCTGTCCCGCTTCCTTCCTAATTGTAAn 7 2 0 

ITTCGTAATAATGCGATGAAAACCTGCAAAGAGTGGCnATAGnAAGCTAACAAACGAG 780 

AGGGCAAGTCCAGGTCAGTAAGTTTTTTCCATCCCGAAAGGTGTCCGTTAGnCAACCGC 840 

TAAGAAGGGGACGCGnATGGATGAATACTCACCCAAAAGACATGATATCGCACAGCnA 900 

35 AGTTTCTCTGTGAAACCCTGTATCATGACTGCCTTGCAAACCTTGAAGAAAGCAATCATG 960 

GCTGGGTAAACGACCCAACCTCGGCGATCAACCTCCAGnGAATGAACTGATTGAGCATA 1020 

TTGCGACCnCGCACnAATTACAAAATTAAGTATAATGAAGACAATAAGCTCATTGAGC 1080 

AGATCGACGAATATCTGGATGACACCTTTATGnGnCAGTAGTTATGGTAnAATATGC 1 140 

AGGATCnCAGAAATGGCGGAAGTCAGGTAAHCGACTATHCCGTTGTTTTGTCAATGCGA 1200 

40 CGAAAGAGAATCCTGCGAGTTTATCnGTTAGAAnATTACAACCATAGGTAGAAGTATG 1260 

TCCGAAAAACCTTTAACGAAAACCGATTATrTAATGCGTTTACGTCGTTGCCAGACAAn 1 320 

GACACGCTGGAGCGGTTTAWTCGAGAAAAATAAATACGAATTATCAGATAATGAACTGGC 1380 

GGTATTTTACTCAGCCGCAGATCACCGCCTCGCCGAATTGACCATGAATAAACTGTACGA 1440 

CAAGATCCCTTCCTCAGTATGGAAATTTATTCGCTAATAAATAATTCGCTTTCGGAGCTA 1500 

45 TAACCGGCTGTTTAnAAGAATnTATACTTTTTCGCCATGAAGACATACCCTATGTGAT 1560 
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CTnATCACACAGATGTAATGGGAACGTTCTCTTCACTGACTTTTCGTCTTACTGTGTTG 1620 

CCGCATTTTCAGCAACCGGAGTCAGTAATGAGCACAGAAAAAGAAAAGATGATTGCTGGT 168C 

MSTEKEKMIAG 

GAGTTGTATCGCTCGGCAGATGAGACGTTATCTCGCGATCGCCTGCGCGCTCGTCAGCTT 1 740 
5 ELYRSADETL5RDRLRARQL 

AnCACCGATACAATCATTCCCTGGCGGAAGAGCACACATTACGCCAGCAAATTCTCGCT 1800 

IHRYNHSLAEEHTLRQQILA 

GATCTAnCGGTCAGGTGACAGAGGCTTATATTGAGCCAACGTTTCGCTGTGACTATGGC 1860 

DLFGOVTEAYIEPTFRCDYG 

10 TATAACATTTTTCTCGGTAATAATTTTrTCGCCAACnCGATTGCGTGATGCTTGATGTC 1920 

YNIFLGNNFFANFDCVMLOV 

TGCCCTATTCGCATCGGTGATAACTGTATGTTGGCACCAGGCGnCATATCTACACGGCA 1980 

CPIRiGONCMLAPGVHlYTA 

ACACATCCCATCGACCCTGTAGCACGTAATAGCGGTGCTGAACTGGGGAAACCCGTCACC 2040 
15 THPIOPVARNSGAELGKPVT 

ATCGGTAATAACGTCTGGAnGGCGGACGCGCGGTCATTAACCCTGGTGTGACCATTGG^ 2 1 00 

IGNNVWIGGRAVINPGVTIG 

GATAACGTCGTGGTAGCCTCAGGTGCAGTTGTCACAAAAGATGTCCCGGACAACGTTGTC 2 1 60 

DNVVVASGAVVTKOVPDNVV 

20 GTGGGCGGTAATCCAGCCAGAATAATTAAAAAAnGTAATCGGTTTTTCGCAACTGTTAT 2220 

VGGNPARIIKKL 

GCAAAATTGTGGTAGATCTGnACTTCCCCTCTACTATTCCCACGTTAWTAGGGTGTT 2280 

CCCTGGAAAGTTGCAGATACCACGAAGGCAAACGATGACCGAAATACAACGCCTGCTGAC 2340 

CGAAACGATTGAGTCTCTGAATACCCGCGAAAAACGCGACAACAAACCCCGCTTTAGTAT 2400 

25 CAGTnTATCCGTAAACATCCGGGGCTGrrTATCGGTATGTACGnGCI 1 1 1 1 1 IGCCAC 2460 

CCTGGCGGTGATGTTGCAGTCCGAAACGCTGTCAGGCTCTGTCTGGCTACTGGTTGTATT 2520 

ATTTATCCTGCnAATGGmcnClTTTTCGATGTCTACCCACGCTACCGCTATGAAGA 2580 

TATCGACGTGCTGGATTTCCGCGTTTGCTATAACGGCGAATGGTACAACACGCGCTTTGT 2640 

ACCTGCCGCGCTGGTTGAAGCCATCnGAACTCTCCGTGTCGCGGATGTTCATAAGGAAC 2700 

30 AACTGCAAAAAATGATCGTCCGTAAAGGTGAACTGTCTnTTACGATAnTTTACCCTCS 2760 

TCGCGCCGAATCAACATCTTAAGTTAGGGnACATACCAGGCGTAAAGCTCTGCGCCTGG 2820 

TCAAATGACAATGATCGTTTCCACCCATCACnCATGAAATACCAGCTCTACCTCCnAT 2880 

CTCCAGCCAGCCTTTTTCCACAATCAGATATACTTTCCCTACACTGTGTTAATAAGGATA 2940 

TGCTGGTGAGAACACGACATCTGGTCGGCCTTATrTCGGGAGTACTGATTr.TTTCAGTAT 3000 

35 TGCTGCCTGTCGGCnAAGCATCTGGCTGGCCCATCAGCAGGTAGAAACATCGTTTATTG 3060 

AAGAGCTGGATACCTATTCCTCCCGCGTCGCTATTCGAGCCAATAAGGTGGCGACACAAG 3120 

GGAAAGATGCGCTGCAG 3137 



40 
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CLAIMS 

1. An enzyme having a(l,4) glucan acetyl-transferase activity, wherein the 
enzyme comprises the amino acid sequence shown as SEQ ID No. 1. or a variant, 
homologue or fragment thereof. 

2. A recombinant enzyme having a(l,4) glucan acetyl-transferase activity, 
wherein the enzyme comprises the amino acid sequence shown as SEQ ID No. 1, or 
a variant, homologue or fragment thereof. 

3. A recombinant enzyme having a(l,4) glucan acetyl-transferase activity, 
wherein the enzyme has the amino acid sequence shown as SEQ ID No. L 

4. A recombinant enzyme having a(l,4) glucan acetyl-transferase activity, 
wherein the recombinant enzyme is immunologically reactive with an antibody raised 
against a purified recombinant enzyme according to claim 3. 

5. A nucleotide sequence coding for the enzyme of any one of claims 1 to 4 or 
a sequence that is complementary thereto. 

6. A nucleotide sequence according to claim 5, wherein the nucleotide sequence 
is a DNA sequence. 

7. A nucleotide sequence comprising the sequence shown as SEQ ID No. 2, or 
a variant, homologue or fragment thereof or a sequence that is complementary 
thereto. 

8. A nucleotide sequence having the sequence shown as SEQ ID No. 2. 

9. A construct comprising or expressing the invention according to any one of 
claims 1 to 8. 
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10. A vector comprising or expressing the invention of any one of claims 1 to 9. 

II- A piasmid comprising or expressing the invention of any one of claims 1 to 
10. 

5 

12. A transgenic organism comprising or expressing the invention according to any 
one of claims 1 to 11. 

13. A transgenic organism according to claim 12, wherein the transgenic organism 
10 is a plant. 

14. A modified carbohydrate (preferably starch) prepared by a method comprising 
or expressing or using the invention according to any one of claims 1 to 13. 

15 15. An enzyme substantially as described herein. 
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^ ATGAGCACAGAAAAji\GAAA>iGA:GATTGCTGGT :d8G 

GAGnGTATCGCTCGGCAGATGAGACGTTArCTCGCGATCGCCTGCGCGC^CGTCAGCT: [IdQ 
ATTCACCGATACAATCATTCCCTGGCGGAAGAGCACACArTACGCCAGCAAArTCTCGC' L8CC 
GATCTATTCGGTCAGGTGACAGAGGCTTA'ATTGAGCCAACGTTTCGCTG'GACTATGGC 1360 
TATAACATTTTTCTCGGTAATAATTTTTTCGCCAACTTCGAIT^ 1920 
1 0 'GCCCTATTCGCATCGGTGATAACTGTATG7TGGCACCAGGCGTTCATATCTACACGGCA 1980 
ACACATCCCATCGACCCTGTAGCACGTAA7AGCGGTGCTGAACTGGC-GAAACCCGTCACC 2040 
ArCGGTAATAACGTCTGGATTGGCGGACGCGCGGTCAnAACCCTGGTGTGACCATTGGT 2100 
GATAACGTCGTGGTAGCCTCAGGTGCAGTTGTCACAAAAGATGTCCCGGACAACGTTGTC 2160 
G rCGGCGGTAA TCC AGGC AGAATAATTAAAAAA ITGTAA 

15 

-IGURE 2 

i^STE.<E<MlAGcLYRSADETLSRDRLRAR0LIHRYNHSLAEEHrLROQ:. 50 

:o 

AOLFGCVTi A Y I E OTFRCO YG YN [ FLGNNFFANFOC VMLDVC? I R ! GDf JC ICO 
MLAPGVHIYTATHPIDPVARNSGAELGKPVTIGNNVW[GGRAV[NPGVT[ 150 



25 GDNVVVASGAVVTKDVPDNVVVGGNPARI IKKL 



183 
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GAAnCGCCAAAGACTTGATGGA7AAAGAAGGTAAAGGTC'GATTGAAGCGACGC"TGA7 50 

5 GCGGTGCGGATGCGTTTACGTCCGATCCTGATGACCTCGCrGGCGTTTATCCTCGGCGTT :2Q 

ATGCCGCTGG^TArCAGTACTC'GTGCTGGTTCCGGCGCGCAGAACGCAGTAGGTACCGGT* :30 

GTAATGGGGGGGATGGTGACCGCAACGGTACTGGCAATCrrCTTCGTTCCGGTAnCTTT /40 

GTGGTGGTTCGCCGCCGCTTTAGCCGCAAGAATGAAGATATCGAGCACAGCCATACTGTC jQO 

GArCATCAT7GATACAACGTG7AATCACTAAGGCCGCGTAAGCGGC::TT-TTArGCATA 360 

LO ACCTACGAACATTAAGGAGTAArTGAACCACCAACTCAGGATCTCATACGAAAACCAG^A ^20 

rTAACCA.CGGATAAAATTCATAAAAAATACTGATTGTTAGTTAArTTATATTAAGTAGCG m 

CTAATAGATnAATAATCCATAATCAnTAGAGGCTATTCTTAAmTTTGCGGTAArT: 540 

TiTATTCAncCTCGGTTArTACGTCATArrCAGAGCAATCCTGGTATTAGTGTCACCAA 500 

mCATCTC-GCGATAATCCTGAAATGTOTGAATAGTTCGAGCAAAITGCTmACCTG 660 

1 5 ^GCGGG7TAGTGCTAGTATGAAAAAGTGA.GTCCTGTCCCGCTTCCnCCTAArTGTAATT 720 

rTTCGTAATAATGCGATGAAAACCTGCAAAGAGTGGCTTATAGTTAAGCTAACAAACGAG 780 

AGGGGAAG7CCAGGTCAGTAAG I I i M I CCATCCCGAAAGGTGTCCGTTAGTTCAACCGC m 

TAAGAAGGGGACGCGTTATGGATGAATACTCACCCAAAAGACATGATATCGCACAGCTTA 900 

AGTTTCTCTGTGAAACCCTGTATCATGACTGCCTTGCAAACCTTGAAGAAAGCAATCATG 960 

20 GCTGGGTAAACGACCCAACCTCGGCGATCAACCTCCAGTT6AATGAACTGATTGAGCATA 1020 

rTGCGACCTTCGCACTTAATTACAA^nAAGTAT.AATGAAGACAATAAGCTCATTGAGC 1080 

AGATCGACGAATATCTGGATGACACCTTTATGnGTTCAGTAGTTATGGTArTAATATGC 1140 

AGGATCTTCAGAAATGGCGGAAGTCAGGTAAHCGACTATHCCGTTGimGTCAATG^ 1200 

CGAAAGAGAATCCTGCGAGmATCnGmGAAnAmCAACC ATAGGTAGAAGTATG 1260 

25 7CCGAAAAA.CCmAACGAAAACCGAnAmAATGCGTTTACGTCGiTGCCAGACAA7T 1320 

GACACGCTGGAGCGGmAWTCGAGAAAAATAAATACGAATTATCAGATAATGAACTGG^ 1380 

GGTArrTTACTCAGCCGCAGATCACCGCCTCGCCGAATTGACCATGAATAAACTGTACGA 1440 

CAAGATCCCTTCCTCAGTATGGAAAmATTCGCTAATAAATAArrCGCrrrCGGAGCTA 1500 

TAACCGGCTGTTTATTAAGAAnTTATACTTmCGCCATGAAGACATACCCTATGTGAT 1560 

30 CTnATCACACAGATGTAATGGGAACGTTCTCnCACTGACTTTTCGTCTTACTGTGiTG :520 

CCGCATrrCAGCAACCGGAGTCAGTAATGAGCACAGAAMAGAAAAGATGATTGCTGGT 1630 

M 5 T £ < £ K M [ A G 

GAGTTGTATCGCTCGGCAGATGAGACGrTATCTCGCGATCGCCTGCGCGCTCGTCAGCTT 1740 
^LYRSAOETLSRDRLRARQL 

35 ATTCACCGATACAATCAnCCCTGGCGGAAGAGCACACATTACGCCAGCAAAnCTCGCT 1800 
IHRYNHSLAEEHTLRQQILA 

GATCTATTCGGTCAGGTGACAGAGGCTTATAnGAGCCAACGTTTCGCTGTGACTATGGC 1860 
DLFGOVTEAYIEPTFRCOYG 

TATAACATTrrrCTCGGTAATAATTTTrrCGCCAACTTCGATTGCGTGA 1920 
40 YNIFLGNNFFANFDCVMLDV 

TGCCCTATTCGCATCGGTGATAACTGTATGTTGGCACCAGGCGTTCATATCTACACGGCA 1980 
CPIRIGDNCMLAPGVHIYTA 

ACACATCCCATCGACCCTGTAGCACGTAATAGCGGTGCTG;j;CTGGGGAAACCCGTCACC 2040 
^HPIDPVARNSGAELGKPvr 

45 ATCGGTAATAACGTCTGGATTGGCGGACGCGCGGTCATTAACCCTGGTGTGACCATTGGT 2100 

tgnnvwiggravinpgvtig 
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GATAACGTCGTGG7AGCCTCAGGTGCAG7TGTCACAAAAWTGTCCCGGACAACGTTGTC 2 160 
CNVVVASGAVVTKOVPONVV 

5 GTGGGCGGTAATCCAGCCAGAArAATTAAAAAATTGTAATCGGTTTTTCGCAACTGrrAr 2220 
V G G N P A R I [ K K L 

GCAAAAn"GTGG7AGATCTG7TACnCCCCTCTACTATTCCCACGTTAAAATAGGG'GTT 2280 

CCCTGGAAAGTTGCAGATACCACGAAGGCAAACGATGA.CCGAAATACAACGCCTGCTGAC 2340 

CGAAACGATTGAGTCTCTGAATACCCGCGAAAAACGCGACAACAAACCCCGCTTTAGTAT 2400 

10 CAGirnATCCGTAAACATCCGGGGCTGTTrATCGGTATGTACGTTGCTTTTTTTGCCAC 2460 

CCTGGCGGTGATGTTGCAGTCCGAAACGCTGTCAGGCTCTGTCTGGCTACTGGnGTATT 2520 

ArrTATCCTGCTTAATGGTTTCnCTTTTTCGATGTCTACCCACGCTACCGCTATGAAGA 2580 

TATCGACGTGCTGGATTTCCGCGTTTGCTArAACGGCGAArGGTACAACACGCGCiTTGT 2640 

ACCTGCCGCGCTGGnGAAGCCATCTTGAACTCTCCGTGTCGCGGATGnCATAAGGA/iC 2 700 

15 -ACTGCAA/MATGATCGTCCGTAAAGGTGAACTGTCTTTTTACGArATTTTTACCCTCS 2760 

TCGCGCCGAATCAACATCTTAAGTTAGGGTTACATACCAGGCGTAAAGCTCTGCGCCTGG 2820 

rCAAATGACAA:GATCGiTTCCACCCATCACnCATGAAATACCAGCTCTACCTCCTTAT 2880 

CTCCAGCCAGCCTTTTTCCACAATCAGATATACTnCCCTACACTGTGTTAATAAGGATA 2940 

"GCTGGTGAGAACACGACATCyGGTCGGCCTTAnTCGGGAGTACTGATTCTTTCAGTAT 3000 

10 'GCTGCCrGTCGGCTTAAGCATCTGGCTGGCCCATCAGCAGGTAGAAACATCGTTTATTG 3060 

AAGAGCTGGATACCTAiTCCTCCCGCGTCGCTATTCGAGCCAATAAGGTGGCGACACAAG 3 120 

GGAAAGATGCGCTGCAG 21V 
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